An Optimal Processor Replacement Scheme for Efficient Communication of Runtime Data Redistribution
نویسندگان
چکیده
AbstractDynamic data distribution is used to enhance data locality and algorithm performance with reducing inter-processor communication in data parallel programs on distributed memory multi-computers. Since the exchange of data is performed at run-time, there is a performance tradeoff between the efficiency of the new data decomposition for a subsequent phase of an algorithm and the cost of exchanging data among processors. In this paper, we present an Optimal Processor Replacement (OPR) scheme to minimize data transmission cost for general BLOCK-CYCLIC data redistribution. The main idea of the proposed techniques is to employ a size oriented greedy matching method or a maximum bipartite matching theory for exploring an ideal one-to-one mapping between logical processors. Based on the matching policy, a realigned sequence of destination processors can be derived and is then used to perform data redistribution in the destination phase. A significant improvement of this approach is that OPR achieves the highest rate of data remain in local space and leading minimum inter-processor communication. The optimal processor replacement scheme can handle array redistribution with arbitrary source and destination distribution in BLOCK-CYCLIC type and can be applied to multidimensional arrays. To evaluate the performance of the proposed technique, we have implemented the OPR method on an SMP Cluster with 24 nodes. The theoretical analysis and experimental results show that our technique provides considerable improvement for runtime data redistribution.
منابع مشابه
An efficient certificateless signcryption scheme in the standard model
Certificateless public key cryptography (CL-PKC) is a useful method in order to solve the problems of traditional public key infrastructure (i.e., large amount of computation, storage and communication costs for managing certificates) and ID-based public key cryptography (i.e., key escrow problem), simultaneously. A signcryption scheme is an important primitive in cryptographic protocols which ...
متن کاملImproving communication scheduling for array redistribution
Many scientific applications require array redistribution when the programs run on distributed memory parallel computers. It is essential to use efficient algorithms for redistribution, otherwise the performance of the programs will degrade considerably. The redistribution overheads consist of two parts: index computation and inter-processor communication. If there is no communication schedulin...
متن کاملAn Incentive-Aware Lightweight Secure Data Sharing Scheme for D2D Communication in 5G Cellular Networks
Due to the explosion of smart devices, data traffic over cellular networks has seen an exponential rise in recent years. This increase in mobile data traffic has caused an immediate need for offloading traffic from operators. Device-to-Device(D2D) communication is a promising solution to boost the capacity of cellular networks and alleviate the heavy burden on backhaul links. However, dir...
متن کاملAssessing the cost of redistribution followed by a computational kernel: Complexity and performance results
The classical redistribution problem aims at optimally scheduling communications when reshuffling from an initial data distribution to a target data distribution. This target data distribution is usually chosen to optimize some objective for the algorithmic kernel under study (good computational balance or low communication volume or cost), and therefore to provide high efficiency for that kern...
متن کاملContention-Free Communication Scheduling for Irregular Data Redistribution in Parallelizing Compilers
The data redistribution problems on multi-computers had been extensively studied. Irregular data redistribution has been paid attention recently since it can distribute different size of data segment of each processor to processors according to their own computation capability. High Performance Fortran Version 2 (HPF-2) provides GEN_BLOCK data distribution method for generating irregular data d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004